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Abstract 

Consider a graph on randomly scattered points in an arbitrary space, with 
two points x,y connected with probability <f(x,y). Suppose the number of 
points is large but the mean number of isolated points is 0(1)- We give 
general criteria for the latter to be approximately Poisson distributed. More 
generally, we consider the number of vertices of fixed degree, the number of 
components of fixed order, and the number of edges. We use a general result 
on Poisson approximation by Stein’s method for a set of points selected from a 
Poisson point process. This method also gives a good Poisson approximation 
for Poisson U-statistics. 


1 Introduction 

In the inhomogeneous random graph (IRG), each vertex has one of several possible 
types or states, where the space of possible states may be infinite. Given the states 
of the vertices, each possible edge is present with a probability that depends on 
the states of the two endpoints, independently of the other edges. This provides a 
very flexible class of random graph models; for example, the state of a vertex could 
represent spatial location, or it could represent the time at which a vertex is born, 
for a randomly evolving graph. This class of models was introduced in [20], and has 
been studied in depth, for example in |4j and [2]. Much of the literature is concerned 
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with the birth of the giant component, but full connectivity has also been studied 
in [6] . 

For our purposes, the IRG is defined as follows. Let (X, J 7 , y) be a probability 
space (the state space). For s > 0 let V s be the (random) set of points of a Poisson 
point process on X with mean measure sy. Also, for n G N, let X n be the binomial 
point process consisting of n independent random elements of X which common 
distribution y. 

Suppose 0 : X x X —)• [0,1] is a measurable symmetric function; we call such a 
function a connection function. Given finite V C X (possibly with multiplicity), let 
G(V, 0) be the random graph with vertex set V, with each pair {x, y} of points of V 
connected by an edge with probability f>(x,y), independently of all other pairs. We 
are particularly interested in G(fP s ,(f )) and G(A n ,0); we define these graphs more 
formally in Section |4j 

In the special case where X is a region of Euclidean space and f>(x, y) is de¬ 
termined by the displacement x — y (typically via ||x — y ||, where || • || denotes 
the Euclidean norm), the IRG is also known as the soft random geometric graph 
m or random connection model (RCM) pT5]J . If, in the Euclidean setting, we have 
f(x,y) = l{||x-y||<r} the IRG is known as the random geometric graph (RGG) [16] 
or Gilbert graph. These models are important in applications to wireless communi¬ 
cations; see for example mmmm 

For any graph G and any j G No := N U {0}, let Dj(G ) denote the number of 
vertices in G of degree j; also set D<j(G) := DjfG). In particular, D 0 (G) is 
the number of isolated vertices. Of interest is the question of whether D< k (G(V s , 0)) 
and H< fc (G(A^j, 0)) are approximately Poisson distributed for s large, with k fixed, 
e.g. k = 0. One reason for interest is that N 0 (G ) = 0 (respectively N< k (G ) = 0) 
is clearly a necessary condition for G to be connected (resp. (k + l)-connected), 
and for many choices of X and 0 n this condition is asymptotically sufficient (in 
probability) when G = G(X n ,(fi n ) with n large; see [19, THUS, TTlf9]. In such cases, 
F[D< k (G(X n , 0 n )) = 0] is a good approximation for the probability that G(X n ,f> n ) 
is (k + l)-connected, so if we can estimate the former via Poisson approximation, 
we may have a useful estimate for the probability of (k + l)-connectivity. 

The references just mentioned show that in many cases where s is large and 0 
is chosen so that E Dj(G(fP s , 0)) = 0(1), the distributions of Dj(G(V s ,4>)) an d of 
Dj(G(X\ s \, 0)) are approximately Poisson. In this paper we give a general criterion 
for this to be the case, without making any geometrical or topological assumptions 
on the space X (Theorem l2.ip . We give a similar result for the number of components 
of order k with k fixed (Theorem [272]) , and for the number of edges (Theorem 12.3p . 

We prove Theorems 12.1112.21 and 12. 3 1 in Sections^ [6]and[7]respectively. The proofs 
use Theorem 13.11 a general result on Poisson approximation for functionals of Poisson 
processes, which is of independent interest. This theorem also gives us further results 
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on the Poisson approximation for the number of edges under different assumptions 
from those of Theorem 12.31 See Theorem o and the subsequent discussion. It 
also provides an alternative way to derive (and slightly improve) a result of |5j on 
Poisson approximation of Poisson [/-statistics; see Section El 


2 Results on inhomogeneous random graphs 

Let $ be the class of all measurable symmetric functions from X x X to [0,1], and 
for £ > 0 let be the class of all 0 G $ that satisfy 


inf 


(j)(x,y)n(dy) ) 




> £ Slip 
a?EX 


(f)(x,y)n(dy) 


If 0 e we say the connection function 0 is e-homogeneous. Note that 1-homogeneity, 
according to our definition, is the same as homogeneity as defined in [6J. 

Several of our results require ^-homogeneity. There are many interesting classes 
of connection function which satisfy this condition. For example, suppose X is a 
bounded convex region in Euclidean space R d , and p has a density (with respect to 
Lebesgue measure on that region) that is bounded away from 0 and infinity. Then 
there exists £ > 0 such that all connection functions of the form f>(x,y) = if(\x — y\) 
with 0 nonincreasing lie in <h e . In particular, we do not require any exponential 
decay condition on -0, such as that imposed in im 

For 0 G <!>, set 0 = sup Xy (j)(x,y). Suppose we have connection functions 0 S 
defined for each s > 0 with E D<j(G(V s ,<f> s )) bounded as s —» oo, for some fixed 
j G No- See (15. ip below for a formula for E Dj(G(fP s , 0 S )). Our Erst result shows 
that Dj(G('P s ,4 > s )) is approximately Poisson for large s, under the condition that 
0s G for all s, along with an annoying extra condition that 0 S is bounded away 
from 1. 

For a G (0, oo), let Z a denote a random variable having the Poisson distribution 
with parameter a. For any graph G we define D<_i(G) := 0. 

Theorem 2.1. Let j G No- Suppose that for some £ > 0 and all s G (0, oo) we have 
0 S G and <f> s < 1 — e. As s — » oo, if E D<j(G(V s , 0 S )) —* a G (0, oo), then 


Dj(G(V s , (!>,)) A Dj(G(X ls j,0 w )) A Z a , (2.1) 

and also 

D<j-i(G(V s , 0 S )) -A 0; H< i _ 1 (G'(A’ L s J ,0 L sj)) A 0. (2.2) 
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It is interesting to compare the conclusion of this result with the example on 
page 55 of [16]. In that case, for a certain sequence of RGGs one can arrange for 
the number of vertices of degree 2 to be asymptotically compound Poisson, whereas 
here it is asymptotically Poisson. 

For k G N, and for any graph G, we refer to the components of G of order k 
(i.e., with k vertices) as the k-components of G. Let N k (G) denote the number of 
^-components in G. In particular, Ni(G) = D 0 (G). 

Suppose we have connection functions <f> s defined for each s > 0 with E N k (G(V s , f> s )) 
bounded as s — * oo, for some fixed k G N. Our next result shows that N k (G(V s , 4> s )) 
and Nk(G(X\s j,0|_ s j)) are approximately Poisson under the condition that <f s G 
and (p s < 1 — e for all s and some fixed e > 0; even without this condition, the Pois¬ 
son approximation holds provided that cf) s = o(l/logs) (in the case of G(V s ,<j> s )), 
or provided that (f >, s i = o(s -1 / 2 ) (in the case of G{Xy 8 ^ 0[ s j))- 

We also give a result on asymptotic normality, when EiV^G^,^)) grows 
slowly as s —> oo. Let Af denote a random variable having the standard normal 
distribution in M, i.e. with probability density function (27r) _1 exp(—x 2 /2), i6l. 

Theorem 2.2. Suppose we are given k G N, a G (0, oo) and <f s G $ for s > 0. 

(a) Suppose E N k (G(V s , <f) s )) —> a as s — * oo. Then 

(i) if 4> s = °(l/l°g s )j then N k {G(V s ,(j) s )) —> Z a as s —)■ oo, - 

(ii) if <f> s = o(s _1/2 ) as s —)■ oo, then N k (G(X[ s ^ 0pj)) Z a as s ^ oo; 

(m) If there exists e > 0 such that f> s G and (p s < 1 — £ for all s, then as 
s —» oo we have 

N k {G{V s As)) A N k (G(X Vs j,0 w )) A Z a . (2.3) 

(b) Suppose instead as s —> oo that E N k (G(V s , <f) s )) —>• oo, but E N k (G(V s , 0 S )) = 
o(s), and also that there exists e > 0 such that <f> s G and (f> s < 1 — e for all s. 

Then setting a s = E N k (G(V s ,<j> s )), we have that (N k (G(V s , (f s )) — a s )/Af. 

Among other things, the case k = 1 of Theorem 12.21 (a) (i) generalizes Lemma 
3.2 of [T7] to a more general class of (X, <f> s ) than is considered in [TT j. 

Our next result is concerned with Poisson or normal approximation for (a gener¬ 
alization of) the number of edges of G((P s ,(j) s ). This is of interest in itself, and will 
also be of use in the proof of Theorem 12.11 Given k G Id with k > 2, for any graph 
G let H k (G) denote the number of connected induced subgraphs of G of order k (so 
in particular, H 2 (G) is the number of edges.) 

Theorem 2.3. Let £ > 0 and k G N with k > 2. Suppose <f> s G <L £ for all s > 0. Set 
G s = G(V S ,4> S ). Then: 

(i) As s —y oo, if~EH k (G s ) —> a G (0, oo), then H k (G s ) ——> Z a . 
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(ii) As s —* oo, if E Hk(G s ) oo but E Hk(G s ) = o(s), then setting a s := 

E H k (G s ), we have (. H k (G s ) - a s )/y/af A/”. 

The proof of Theorems 12.1112.21 (iii) and 12.31 provides information about the rates 
of convergence in these results. 

We now discuss some of the literature related to these results. Much of this 
concerns the RGG. Suppose (X, J 7 , fi) is the d-dimensional Euclidean space with the 
Borel cr-algebra and with /i having a density / with respect to Lebesgue measure, 
while the connection function is given by f s (x,y) = l{|x-i/|<r- s }, with r s chosen in 
such a way that E D 0 (G(V S , 0 S )) —> a G (0, oo), i.e. (by (15.ip below) 

s [ exp f-s [ f(y)dy j f{x)dx a (2.4) 

J M d V JB rs (x) J 

where B r {x) := {y G : \y — x\ < r}. 

Gupta and Iyer [5] consider the RGG over densities of the form f(x) = C\ exp(—c 2 |a:| C3 ), 

for r s chosen to satisfy (12.4p . and show that D o —> Z a in this case. This generalises 
earlier work in |18| on the case where C 3 = 2 (i.e., the standard normal distribution). 

Hsing and Rootzen HQ consider the RGG in d — 2 for a general class of densities 
with unbounded support. In their Theorem 1, they give a general set of conditions, 
including (12.4)1 . under which D 0 (G(V S , r s )) —> Z a . They show that these condi¬ 
tions apply, for example when the density / is bivariate normal with standardized 
marginals, or has independent Weibull distributed coordinates. 

Suppose (X, J c , /i) is the d-dimensional unit cube equipped with Lebesgue mea¬ 
sure. Suppose (ft s is of the form (f> s (x,y) = p s l{|x-i/|<r- s }, with p s and r s chosen so 

that E Do(G(V s , 4> s )) ca, then Do —» Z a . More generally, if <f s is of the form 
4> s (x,y) = ij} s (\x — y\) with ijj s a decreasing function satisfying an exponential decay 
condition, then the Poisson convergence of D 0 is known to hold; see na. Our results 
enable us to relax the exponential decay condition and allow for other distributions, 
for example with density bounded away from zero and infinity on a convex compact 
region in M d . In the special case where f s (x,y) = ^-{\ x -y\<r 3 }i several results similar 
to those in our presented here are given in [16]; see Theorems 3.4 and 8.1 there. 

It is tempting to think that for a fixed and s large, for any choice of connection 
function f satisfying E D 0 (G(V S , </>)) = a, the distribution of D 0 (G(V S , </>)) would be 
approximately Z a . For example, if <fi(x,y) G {0,1} for all x, y, then intuition might 
suggest that for large s, if 0 is tuned so that E D 0 (G(V S , 0)) = a, then each isolated 
vertex is a ‘rare event’, and the information that a particular vertex (at x, say) is 
isolated affects only its neighbourhood (i.e. the region {y G X : 0(x,r/) = 1}). 

However, this is not always the case. See the last paragraph of CD Section 
2], Suppose for example that (X, J 7 , / 1 ) is the unit interval equipped with Lebesgue 
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measure, (f> s {x,y) is equal to 1 if ma x(x,y) < s _1 or min(a;, y) > s -1 , and oth¬ 
erwise cj) s (x,y) = 0. Then in the large-s limit the random variable D 0 (G(V S ,4> S )) 
is Bernoulli distributed with parameter e _1 , not Poisson distributed. In this case 
the event that there is an isolated vertex in [0,1/s] is not rare. The condition of 
^-homogeneity which appears in many of onr results, rules out this sort of example. 

Devroye and Fraiman [ 6 ] consider D 0 (G(X n , <f n )) on a general space X in the 
case where <f> n = ( a(n)n ) A 1 for a fixed function k:IxXa M + (such a function 
n is called a kernel ), and some sequence a{n) (where A denotes minimum). This 
is a common assumption for the IRG; see for example [4]. Their results may be 
interpreted as saying that (under certain conditions) the threshold value of a above 
which G(X n , an A 1) is free of isolated vertices (which may be viewed as a random 
variable) satisfies a weak law of large numbers; it is asymptotic to a constant times 
(logn)/n, in probability. They also derive a similar law of large numbers for the 
threshold for the graph to be connected. 

When n is bounded, our Theorem 12.21 (part (a) (ii)) gives a possibility of deriving 
convergence in distribution for this threshold (suitably transformed). For example, 
suppose X = [0,1) and /i has a density / with respect to Lebesgue measure, with / 
bounded away from zero. Suppose also that n(x,y) is determined via the absolute 
value of x — y (mod 1) (so we are in the one-dimensional torus). If / = 1, or if / is 
smooth with g{x) := f k(x , y) f (y) (dy) having a unique minimum, then it should be 
possible to derive a distributional limit law for this threshold, since in these cases 
it should be possible, for any a G ( 0 ,oo), to determine a sequence a s such that 
E N 0 (G(V S , a s n)) —» a, and then apply Theorem 12.21 (a) (ii). 

If k is unbounded but shift-invariant on the torus, for example if k(x, y) = 
((x — y)mod l ) -7 for some fixed positive 7 and / = 1 , then for any £ e ( 0 , 1 ), it 
may be possible to find a limiting distribution for a suitable transformation of the 
threshold value of a above which G(X n , an A (1 — e)) is be free of isolated vertices, 
now using Theorem 12.21 (a) (iii). Indeed, in this case the connection functions are all 
1-homogeneous, and by using connection functions = a n n A (1 — e) rather than 
the more standard (p n = a n n A 1, we ensure that the extra condition <p n < 1 — £ is 
also satisfied. 

It would be interesting to fully work out and extend these examples; to get a 
similar results for the connectivity threshold; and to improve the weak law of [ 6 ] to 
a strong law. 


3 A general result on Poisson approximation 

Let (M,_M,m) be a probability space (known as a mark space). Assume that the 
probability measure m on M is diffuse , by which we mean that there is a product 
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measurable set A C M x M with (m <g) m)(A) = 0, such that the diagonal {(t,t) : 
t G M} is contained in A. For example, if {t} G M. and m({i}) = 0 for all t G M, 
then m is diffuse. 

Suppose on a suitable probability space that we have a sequence ((Xi,Ti),i = 
1, 2,3,...) of independent identically distributed random elements of X x M with 
common distribution p, <g) m, and an independent unit rate Poisson counting process 
(Z s ,s > 0), so that the random variable Z s has Poisson(s) distribution for each 
s, and also a further independent sequence (r, r 1; r 2 , r 3 ,...) of independent random 
elements of M with common distribution m. By our assumption that m is diffuse, 
the values of r, Tf, T\, T 2 , t 2 , T 3 , t 3 , ... are almost surely distinct. 

A finite point process in X is defined as a random element of the space S(X) of 
all finite subsets of X equipped with the smallest a -algebra <S(X) containing the sets 
{£ G S(X) : £(£>) = k } for all B G T and all k G N 0 , where £(-£>) := |£ fl B\ and |£| 
denotes the number of elements of £. 

A hnite point process in X x M is dehned similarly as a random element of the 
space S := S(X x M), where X x M is equipped with the product cr-algebra T ® M.. 
For k G N, let S fc := {£ G S : |f | = k}. 

Let s > 0, n G N. Dehne the following point processes in XxM: 

r,:=ufi 1 {(X.,T j )}; fn-Uf.jtCV.Tj)}. (3.1) 

Then r) is a Poisson point process in X x M with mean measure A x m, where we 
set A = s/x. Similarly, is a binomial point process in XxM. 

Let cItv an d d w denote total variation distance and Wasserstein distance, re¬ 
spectively, between probability measures on the nonnegative integers. That is, for 
No-valued random variables X,Y with distribution £(X), £(Y) respectively, we set 

d TV {£(X), £{Y)) = sup (F[X G A] - P[X G A]); 
ncN 0 

d w (£{X),£(Y)) = sup{|E/x(X) -E/i(y)|| : WAh^ < 1}, 

where for h : N 0 —> M we set A h(i) := h(i + 1) — h(i) for i G N 0 , and ||/x||oo := 
sup ieNo \h(i)\ 

The following theorem is related to a well-known result on the Poisson approx¬ 
imation of a sum of Bernoulli random variables by Stein’s method via coupling 
(Theorem II.24.3 of |43], or Theorem l.B of p]). Here the terms in the sum are 
themselves indexed by h-subsets of the set of points of a (marked) Poisson point 
process. 

Let k E N and let / : Sfc x S —> {0,1} be a measurable function. For (G S, set 

m~ X mnvo ( 3 . 2 ) 

p/>C£#|=/c} 
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We can think of / as a mechanism for selecting some of the A;-subsets of £, and F(£) 
as the total number of /c-subsets selected. 

For xi,...,x k eX set p(x i,.. .,x k ) := n), ..., (x k ,r k )},rj). 

Theorem 3.1. Let W := F(rj ) with 77 and F as described above. Suppose that 
w : X k —» [0, 00 ) is a measurable function, and that for A k -almost every x = 
(xi,... ,x k ) e with p(x 1 ,..., x k ) > 0 we can find coupled random variables f/ x , 14 
such that 

• £(4 X ) = C{W)- 

• £(1 + 14) = £(£(uf = 1 {(a: i , Tj)} U r/)|/{(xi, ri),..., (xfc, Tfc)}, 77 ) = 1). 

• E[|f/ x -14|] <7n(x). 

a = E W — (1/A;!) f p(x)X k (dx). Then 

d TV (C(yV), C(Z a )) < (lA fc ” 1} J w(x)p(x)X k (dx), (3.3) 

and 

d w {C(W), C(Z a )) < 3(1 V2) J w(x)p(x)X k (dx). (3.4) 

The proof of this uses the following fact. Suppose g : S k x S —> M. is a bounded 
measurable function. Then 

E = y / E [9({(xi,T 1 ),...,(x k ,T k )},r])]X k (dx). (3.5) 

p/>Cr/#|=fc} 

This is known as the (multivariate) Mecke formula. See e.g. {16, Theorem 1.6]. 
This fact gives us the assertion in the statement of the theorem that E W = 
(1/A:!) f p(x)X k (dx). 

Theorem 13.II still holds in the case where the measure A is a-hnite but infinite, so 
that the Poisson point process 77 with mean measure A ® m is almost surely infinite. 
The proof is essentially unchanged. 

Proof of Theorem \3.1\ Let h : No —* M be bounded. Then 

E [Wh(W)\ = E ^ /C0) 77 \ if)h(F(r})) 

{4><Zri:\4>\=k} 

= y / E [f({(xi,n),...,(x k ,T k )},7i)h(F (\J k =1 {(xi,Ti)}U7i))] X k (dx) 

= y J E [ h (F(u k =1 {(xi,T i )}Ug))\f(u k =1 {( y x i ,Ti)} 1 ri) = l]p( y x)A( y dx). 








Also, E [ah(W + 1)] = (1/A;!) j'EhjW + l)p(x)A fc (dx), and therefore 

|E [ah(W + 1) - Wh{W)\ \ < ^ f p(x)\(dx) 

x |E h(W + 1) - E [h (F (u^Uxi, Ti)} U V ))\ f (U k i=1 {(xi, Ti)}, rj) = l] | . 

Since \h(i)—h(j)\ < || A/rH^-|i — j\ for i, j G N 0 , we obtain for each x = (xi,...,Xk) G 
X fc that 

|E h(W + 1) - E [h(F (Uf =1 {(^, rj)} U rj)) \ f (uf =1 {(^, r,)}, rj) = l] | 

< |E h(U x + 1) — E h(V x + 1) | 

< || A/iHooE \U x — 141 < ||A/i|| 00 iu(x), 


and therefore 

|E [ah{W + 1) - WhiWj )| < ^ A ^°° 


w(x)p(x)A fc (dx). 


Given A C N 0 , set g = l y 4 and choose h : N 0 —> M so that h(0) = 0 and 


ah{i + 1) — ih(i) = g(i ) — Z a (g),i G No- (3.6) 

Then (see Lemma 1.1.1 of [T]) h is bounded and || A/^Hoo < 1 A a -1 , and hence 

|P[VL G A] — Z a (A) | < (1 ^ 1} J W (x)p(x)A fc (dx). 

The result (13.3p follows. 

One obtains (13.41) similarly by choosing, for any given g with | g(i) — g(i — 1)| < 1 
for i G N, a solution h to (13.61) with h( 0) = 0, and using Lemma 1.1.5 of pQj. □ 


4 Formal constructions of the IRG 

Let s G (0, oo) and n G N. Let (ft G 4>. We now give a more formal definition of the 
graphs G(V s ,<ft) and G(X n ,(ft), 

We make the following particular choice of mark space (M*,Al*,m*). Let Leb 
denote Lebesgue measure on [0,1). Let M* = [0,1) N ° with A4* the product Borel a- 
algebra, and with m* := (g)^L 0 Leb, so that a random element of M* with distribution 
m* is a sequence of independent uniform(0,1) random variables indexed by No- 
Now (and for the rest of this paper) taking (M, A4,m) = (M*,A4*,m*), let 
(Aj,Tj)j>!, (Z s ) s> o and (r, Ti,t 2 , ...) be as in the preceding section. Given n G N 
and s > 0, let the point processes rj and be as given by (13. lj) . We now write rj s for rj 
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to emphasise the dependence on s. Thus rj s is a Poisson point process in X with mean 
measure sji and with each point marked with a sequence of independent uniform[0,1) 
variables indexed by the nonnegative integers. Similarly, £ n is a binomial point 
process in X x M*. We write S* for S(X x M*). 

To ease notation, we shall also assume from now on that the probability measure 
fi on X is diffuse. This ensures that the values of Xi,X 2 ,... are almost surely 
distinct. However, this assumption is for notational convenience only; even without 
it, one can make sense of our results either by allowing the set {X 1 ,..., X n } to have 
multiplicities, or by using the attached marks T* (which are almost surely distinct) 
to distinguish between different points X t . 

Set X n := {Ad,..., X n j and V s := {Ad,..., X Zs j, the canonical projections of 
£ n and rj s respectively onto X. Define the graph G(X n , 0) to have vertex set X n and 
to have an edge between vertices A \ and Xf for i , j e [n] := {1,2,..., n) with i < j, 
if and only if T id < f(X h Xf), where T, t = (T ifi ,T itl ,T it2 ...). Let G(V S ,4>) be the 
graph G{X Za ,(f>). 

This is one way to formally define the random graphs with the properties de¬ 
scribed more informally in the Introduction. It has the advantage that G(X n , 0) is 
a subgraph of G(fP s , <f>) whenever n < Z s and G{V S , 0) is a subgraph of G(X n , 0) 
whenever Z s < n, which is useful for coupling arguments. However, it has the disad¬ 
vantage that the edge-set of the graph G(V S , 0) is not invariant under permutation 
of the order in which the marked points (Ad, Tf), ..., (X Zg , T Zs ) are listed. There¬ 
fore we define a further graph which has the same distribution but also satisfies this 
permutation-invariance. This will be useful in applying Theorem 13. II in the proof of 
Theorems 12.1112.21 and 12.31 

Given £ e S* and 0 6$, define the graph G0,(£) as follows. If there exist distinct 
(x, t 0 , ti ...) e £ and (y, u 0 ,«!,...) e £ with t 0 = u 0 , then set G^(£) to be the empty 
graph. Otherwise, we can write £ uniquely as 

£ = u[!| 1 {(2; i ,^o,^i,^ > 2,...)} 

with G i0 < O,o < • • • < %|,o- Let Gd(£) have vertex set {aq,..., and for each 

i < j < |£| let G^(£) have an edge connecting aq to Xj if and only if Oj < f{xi,Xj). 
In other words, we use the first coordinate of the marks to determine the order in 
which we enumerate the points of £; having done so, for i < j we use the (j + l)-st 
component of the mark attached to the i-tli point to decide whether to connect it 
to the j-th point. 

Let us say that two random graphs G and G' have the same distribution if any 
graph invariant of G has the same distribution as the same graph invariant evaluated 
on G'. The following is immediate from the independence of the components of the 
marks T t . 
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Proposition 4.1. For any finite X = {xi,...,x m } C X the distribution of the 
random graph G^({(xi, rfi ),..., (x m , r m )}) is the same as that of G(X, <fi). 

In particular, the the distribution of G^(r ] s ) is the same as that of G(V s ,(p), 
although they are not the same graph because the set of edges is defined differently 
for the two graphs. Likewise G^ifin) has the same distribution as G(X n ,(p). 

In the following lemmas we check measurability of functions which will feature 
in the proof of Theorems 12.11 and 12.21 (a) (iii) respectively. We use the following 
notation. For t G N and % G [£], let denote the set of (x, to, 4, £ 2 , • • •) € Xx [0, l) No 
such that (i — l)/£ < to < i/t. 

Lemma 4.1. Let j G No- Then the function / : X x M* x S* —> {0,1} given by 
/(x, t, £) := l{x has degree j in Gy,(£ U {(x, t)})}, (x, t, f) gXx M* x S* 

is measurable. 

Proof. For £, n, m, 4..., i n G N with m < n < £, and 4 < 4 < ■ ■ ■ < i n < 4 let 
Ae, n ,m,i denote the set of (x,t,£) gXx M* x S* such that: 

1- (f U {(x,t)})(4ij = 1, for 1 < k < n; 

2- (f U {(*,!)})(/*) = 0, for i e [£] \ {4,.. 

3. (x, t) G I^i m ; 

4 - +TZ= m+ i£({(y,u) e U,i h ■ 4 < 

0(x, ?/)}) = j, where u = (u 0 , u u ...) and t = (t Q , 4,...). 

Then each is measurable in X x M* x S* , and 

/ (*4 40 l{(^4 40 G 

which is a measurable function. □ 

Lemma 4.2. Let k G N. Suppose the function f : (X x M*) fc x S* —>■ {0,1} is given, 
for (xi, 4,..., Xfc, tfe) G (X x M*) fc and f G S* ; by 

/ (x 1, 11, . . . , Xfc, tfc, 0 

:= 1 {{xi,..., Xfc} induces a component of (uf =1 {(xj, 4)} U £) } . 

Then / is measurable. 
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Proof. For £, n, m±,..., m k , ii ■ ■ ■, i n G N with maxi <i< k mi < n < £, and 
mi,, m k distinct, and 1 < i x < i 2 < ■ ■ ■ < i n < £, let de¬ 

note the set of (aq, ti,..., x k , t k ,£) G (X x M*) fc x S* such that for some connected 
graph T on vertex set [k]: 

1. (U^ =1 {(x h , t fc )} U £) = 1, for 1 <j <n; 

2. {u^ =1 {(x h , t fc )} u f) (l e ,i ) = 0, for i e [£] \ {h ,..., i n }\ 

3. (x h ,t h ) G Iu. mh for 1 < h < fc; 

4. for all h G [k] and 1 < j < m with j ^ {mi,..., m k } we have £({(?/, u) G I tj : 
u mh > <f(x h , 2 /)}) = 1, where u = (u 0 , ui, .. .); 

5. for all h G [k\ and m h < j < n with j ^ {mi,..., m k } we have £({(?/, u) G I t] : 
t h ,j > <f>{x h ,y )}) = 1, where t h = (4,o,4,i, • • ■ )> an d 

6. for all h , h' G [k] with mh < my and {h, h'} an edge of T, we have t h) m h < 
(f>(x h ,x h i). 

Then each A^ njkjm 1 ,...,m k ,h,...,i n is measurable in (X x M) fc x S*, and setting [n]^ to 
be the set of (mi,..., m k ) G [n] fc such that mi,..., m k are distinct, we have that / 
is the indicator of the set 

which is measurable. □ 


5 


Proof of Theorem 


2.1 


For s > o, let rj s , Z s , t, ti , t 2 , ... be as in the preceding section. For <f> G $ and 
i G N 0 , we have by the Mecke equation (j3.5[) that 

E Di(G^(r} s )) = s J exp J ^dx). (5.1) 

In the sequel, other formulae for expectations of numbers of vertices, or fc-tuples of 
vertices, having certain properties in terms of the graph G,p(j] s ) (or equivalently, the 
graph G(V s ,<f)), will also be justified by the Mecke formula. These arguments also 
justify (3.5) of [17] . 
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Proof of Theorem \2.1[ Let i G No- For s > 0, set a s = sup xe xf 4 > s(x,y)/d(dy). 
Since we assume <f s G for all s > 0, we have for s > 0 and i G N that 


esa s E Di(G(V s ,(j) s )) so* 

i ~ EA-i(G'(F s , 4>s)) ~ i 


Now fix j G No and suppose <f s is such that as s —> oo we have 


E D<j{G{fP s , 4> s )) —> a G (0, oo). (5-3) 

Suppose that sa s remains bounded as s —* oo along some subsequence. Then 
by (15. ip . E Do(G(V s ,(f>s)) °° along that subsequence, which contradicts (I5.3j) . 
Therefore sa s —> oo as s —» oo. 

By (j5.3p, E Dj(G(V s ,4>s)) remains bounded, and since also sa s —* oo, if 3 > 1 
then using (15. 2 J) we have that K Dj_i(G(V s , (p s )) —» 0, and repeating the argument 
we also have 


EA(G(PJ,))-tO, * = 0,l,...,j —1; (5.4) 

E Dj(G(fP s , (j) s )) —> a. (5-5) 


The first part of (12. 2 p follows from (15.4lh 

For any two (0, oo)-valued functions u(s) and v(s), we say u(s) = Q(v(s)) if 
lim inf u(s)/v(s) > 0 and limsup^^ u(s)/v(s) < oo. We show next that 

sa s = ©(logs). (5.6) 

We already know that sa s —>• oo. Since for any i G N the function x l e~ x is decreasing 
in x for large enough x, for large enough s we have that if sa s < (logs)/2 then by 

(ED, 

EDj(G(P„«) > fs/j!)e _logs/2 ((logs)/2) J , 

which tends to infinity, contradicting (15.31) . On the other hand, since <f s G $£, if 
sa s > (2/e) logs then for large enough s, by (15.ip we have 

E DjiGiVs, </>,)) < se-< 2 ^ s ((2/e)\ogs) j , 

which tends to zero, contradicting (15.5p . Thus we have (15.61) . 

'p 

Next we show that Dj[G(V s ,<f> s )) —> Z a . We aim to apply Theorem 13.11 to 
the case (f) = <f> s of the function / considered in Lemma [4.11 That is, for (x, t,£) G 
X x M* x S* we set f s (x, t, f) to be the indicator of the statement that x has degree 
j in G^Xi U {(x, t)}). Then Dj(G < j >a (r ) s )) = F s (r) s ), where F s is the function F 
obtained by using f = f s in (13.2p (with k — 1). 
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Let s > 0 and x G X. If j > 1 suppose we also have an extra sequence 
(Y, Yx,..., Yj) of independent identically distributed random elements of X with 
P[Y G dy\ = (j) s {x,y)n(dy)/ f <j)(x, z)y(dz), independent of ((W,P))*>i and ( Z s ) s>0 
and (r, n, r 2 ,.. .)■ Let £ s be the graph G^ijls U {(x, r), (Y, ri),..., (Y k , r fc )}), with 
added edges from x to each of ..., Y k (if not already included). Let V s , x be the 
set of points of V s that are connected to x in this graph, and set V* := V s \ V s , x . 

Let U x denote the number of vertices of degree j in the subgraph of Q s induced 
by vertex set V s . By Proposition 14.11 this graph has the distribution of G(V s ,(p s ), 
so U x has the distribution of F s (rj s ). 

Now consider the subgraph of Q s induced by {x} U? s . This has the distribution 
of G{V S U {x}, (p s ), and V s , x is the set of vertices in this graph lying adjacent to x. 
Conditioning on x having degree j amounts to conditioning on \V S , X \ — j■ We define 
a coupled point process V* (a subset of V s U {Lj,..., Yj}) with the distribution of 
V s conditioned on x having degree j, as follows. 

If |P,vx| > j then we select \V S)X \ — j elements of V S)X uniformly at random and 
discard them from V s to get a point process V*. If \V S , X \ < j we set V* := V s U 34, 
where we set 

y* '■= 

Let Q* s denote the subgraph of Q s induced by V* U {x}. Then x has degree j in 
Q*. Let V x be the number of vertices in Q* s having degree j, other than x. This has 
the conditional distribution of F s ({(x, r)} U rj s ) — 1 given that / s ({(x, r)}, r] s ) = 1. 
This is because V s , x and V* are independent Poisson processes, and conditioning 
on /s({(x, t)}, rjs) = 1 amounts to conditioning on the first of these two Poisson 
processes having j points. 

If \V a ,x\ > j then \U X — V x \ < U' x + Vj!, where we set U' x to be the number of 
y G V StX such that y has j neighbours in V s , and V' x to be the number of pairs (y, z) 
with y G V s ,x, z £ V s , such that z ^ y, z is connected to y and z has at most j 
neighbours in P®. Then by the Mecke formula, and the assumption that <p s G $£ 
for all s, 

EU' X = J S(p s (x,y) ^ exp J 5 0 s (|/,z) / i(dz)^ n(dy) 

= 0((sa s y +1 )exp(-Q(sa s )) = o(l), (5.7) 


14 



uniformly over x G X. Also, using that <p s < 1 — £ for all s, we have that 

3 r r / (sf(/) s (z, w)(l - (j) s (x, w))n(dw)) 1 


EV' = Y [ S( !>s{x,y) f s<j> a (y,z) 
i=0 “ J ' 


l\ 


x exp J S(j) s (z , w)( 1 - 4> s (x, w))n(dw)j n(dz)n(dy) 

= 0((sa s y +2 ) x exp(— Q(sa s )) —> 0, (5.8) 

uniformly over iGX. 

If [Ps,x\ < 3 then \U X — V x \ < U x + \ 4, where we set U x to be the number of 
y G 34 having at most j neighbours in V s , and V x is the number of pairs (y, z) with 
y £ 34, z E V s connected to y, and z having at most j neighbours in V s . Then 


EU r < 


itj 

t=0 J 


sa s ) 1 exp(—0(sa s )) 


<i>s{x,y) 


I M x , w)y(dw) 


y{dy) -> 0, (5.9) 


uniformly over iGX. Also 

s J (j) s (z,w)n(dw) ) exp 

<f>a(x,y)M z ,y) 


Et4 < Y 

i=0 ' 


scf> s (z , w)fx(dw) 


XJ 


y(dy)y{dz) 


f (p s { x , ui)n(dw) 

= 0((sa s ) j+1 ) x exp(— 0(sa s )) —>■ 0, 

uniformly over iGX. 

Combining the estimates (15.TJ) , (15.81) . 


(5.10) 


and (15.101) , and using Theorem 13.11 


v 


gives us the first part of (12.lj) . namely Dj(G(V s , 4> s )) Z a . It remains to prove 
the second part of (12.11) . and of (12.2ft . 

For n G N, consider G(X n ,(j) n ). Let s(n) — n — n 3 / 4 and t{n) := n + n 3 / 4 . 
By Chebyshev’s inequality, with high probability (i.e. with probability tending to 
1) we have Z s ( n ) < n < Z t ^ so that V s ( n ) C X n C V t ( n )- Moreover, when this 
happens, G(X n ,(ft) is the subgraph of G(Vt( n )i<f>) induced by X n , and G(V s ( n ),(f>) is 
the subgraph of G(X n ,4>) induced by V 3 ( n ). 

By (15.ip . for i e N 0 , 


s(n) 


n 


< EDj{G{V s (n), <j>n)) < 
~ EDi(G(P n ,</> n )) ~ 


s(n ) 


n 


sup ( exp 

#EX 


n 3/A / <j> n (x,y)fj,(dy) 


and by (15 .6[) . both the upper and the lower bound tend to 1. Therefore by (15.41) and 
(I5.5j) we have as n —> oo that 

EDi(G(V s (n), <t> n )) -)• 0, i = 0,1,..., j - 1; (5.11) 

E Dj{G{V s [n)i 0n)) — y Ol. (5.12) 
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Let k G No with k < j. If V s ( n ) C X n , then 

D<k(G(X n , 0 n )) — -D<fc(G(P s ( n ), 0 n )) = S n — R n , 

where denotes the number of points of X n \ V s (n) with degree at most k in 
G(X n ,(fi n ), and R n is the number of points of V s ( n ) with degree at most k in 
G(V s {n ), 4>n) but with degree at least k + 1 in G(X n , (fi n ). 

Let S' n denote the number of points of V t (n) \ Vs(n) that are connected to at most 
k points of V s (n), and let R' n be the number of points of V s ( n ) with degree at most k 
in G(V s ( n ), (j) n ) but with degree at least k + 1 in G(V t (n), (fin)- If V s ( n ) C X n C V t ( n) , 
then S n < S' n and R n < R' n . Hence by Markov’s inequality, 


P [D< k {G{X n ,(/> n )) ^ D< k (G(V s ( n ), (fin))\ < p [{Z 8(n) <n< Z t{n) } c ] + E S' n + E R’ n . 
Now 


E S' n = 2n 3 / ‘Y, 


(s(n)/0„(i, y)n (dy)Y 


i =0 


exp -s(n) / (fi n (x, y)dy fj.(dx) 


/2 rrV 4 \ 

f-^jED< fc (G(iP s(ri) ,0 n )), 


which tends to zero by (15.111) and (j5. 12[) . Also 

k “ (s(n) f (fi n (x,y)Kdy)Y 


E K < *(»)L / 

i=0 '' 


i\ 


xexpl-s(n) J (fi n (x,y)fi(dy) 
< 2n 3/4 a„E D< k (V s ( n ), (fi n ), 


){ 2n3, \f 


(fi n (x,y)fi(dy) \fjt(dx 


which tends to zero by (15. 6|) . (15.111) and (15.121) . 

Therefore with high probability we have D< k (G(X n , <fi n )) = D< k (G(V s ^, (fi n )). 
This holds both for k — j, and for k = j — 1. Hence using the first part of (12.11) and 
the first part of (12.21) we obtain the second part of (12.11) and of (12.21) . □ 


6 Proof of Theorem 12.2 


Given (fi 6 <L, given k, t € N, and given x = {x\..., Xk) € X fc and y = (yi,..., yfi) G 
X e , set 

k i 

0 ( x > y ) := 1 — I ] [[(1 — ( fi ( Xi , yj)). ( 6 . 1 ) 

i= i j =i 
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We also write 0({aq,..., x k }, {yi, • • •, ye}) for 0(x, y) (allowing multiplicities in the 
sets {xi,..., Xk} and {yi ,..., ye})] it is the probability that there is at least one edge 
in in the random graph G({xi,..., Xk , yi, ■ ■ ■, ye}, 4>) connecting one of the vertices Xi 
to one of the vertices yj. If k = 1 we write f>(xi, {y ±,..., ye}) for <f>({x i}, {yi ,..., ye})- 
Also, let h<^(x) or h^x i,...,Xk) denote the probability that G({x i,... ,x k }] f>) is 
connected, or more precisely 

M^i, • • • ,x k ) = ^2 II 4>{ x ^ x j) II (1 - <f>(xi,Xj)), ( 6 . 2 ) 

r r)} 

where the sum is over all connected graphs T on vertex set {1,..., fc}, and E(G) 
denotes the set of edges of a graph G. By the Mecke formula f!3.5p . and the equality 
in distribution of G(V s ,(p) and G^rjs) as discussed in Section [H 

EN k (G(V s ,(j ) )) = EN k (Gt(v s )) 

h*(x u ...,x k ) exp (-e J 4>(z\ {xi,.. •, x k }Udz)) /(dfe,..., x t )). (6.3) 

Now fiilgN and a G (0, oo). Assume throughout this section that <f> s G $ for 
s > 0, and (unless explicitly stated otherwise) that 

lim E N k (G(V s ,(j>s)) = ot G (0,oo). (6.4) 

s—>oo 



Proof of Theorem IKE (a) (i). Assume that (f s = o(l/logs). We shall use the 
method of moments. 

For n, t G N we write (n)e for the descending factorial n{n — 1) ■ ■ ■ {n — l + 1). 
Then (N k (G(V S i(j) s )))e is the number of ordered ^-tuples of distinct fc-components 
of G(fP s , 4> s )- This equals the sum over all fct'-tuples Xiy,..., x^ k , ■ ■ ■, xqi,..., xe, k of 
distinct points of V s , of the indicator of the event that for each i < i the subgraph of 
G(V S , (ps) induced by x^i ..., x^ k is connected and these vertices are not connected 
to any other vertices of G(fP sl (j) s ), divided by ( k \) . Hence by the Mecke formula, 


E[(N k (G(V s ,<j> s )))e\ 



'u-s (-^ 1,1 • ■ ■, x e,k) 


x exp |^-s J 4> s (z, {xi,i, ..., Xe, k })y(dz) J /i ’ (d(x M ,..., x e>k )), 

where we set ^({xpi,..., xe, k }) to be the probability that the graph G({x i_i,. 
has no edge between any x t ,j and such that i ^ i ', that is, 


(6.5) 

, xe, k }, 4>s) 


U s (x h i . . . , Xe, k ) ■ 11 (1 0s (•£*! ,ji, Xi 2 J 2 )) ■ (6.6) 

(u Jl),(*2 J2)e[r]x[fc]:ii<i2 
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By our condition on (p s the value of u s (xi t i, ■ ■ ■ ,X£ >k ) tends to 1, uniformly over 
( 2 + 1 ,..., X(j.). Also, by the union bound 


i 

4> s (z,{x 1 , 1 , ... , X£ t k}) < ^2 <f)s(z, { 2 + 1 , . .. ,2+4)- 

2—1 

Therefore by (16.5ft . writing just N k for N k (G(V s ,<f> s )), we have 

E[(iV fc ) £ ] > (1 + 0(1)) (y j k h <P s ( X l,...,X k ) 

x exp { s J <f> 8 {z, {xi,..., x k })/j,(dz)\ /i k (d(x i,..., x k )) 

By (16.41) and (16.31) . this lower bound for E [(lV4y| tends to a £ as s —* oo. 

By the Bonferroni bound and the union bound we have for (z,x ip,... ,2+4 G 
X 1+ek that 



,(z, {xi,i, ■ ■ ■ , 2 + 4 ) > ^2 <4(+ {x it 1 , ■ ■ ■ , 2 + 4 ) 1 - ^2 4>s(z, {x jt i, . . . , 2 + 4 ) 


i=1 


j=i +1 


> ^2 {2+1,..., 2+4)4 - k£(f) s ). (6.7) 

i— 1 


Therefore by (16.51) . 


E [(N k ) e ] < 




xexp / <4(+ { 2 +i,... ,2+4)4 “ k£4> s )/i(dz) /i ke (d(x hll .. . ,2+4) 


i= 1 


k\ 


h c j, s ( 2 +, • • •, x k ) 


x exp (-s / (j) s (z,{xi,... , 2 +})(l - M(j) s )n(dz) ) n k (d{x u ... ,x k )) 


Since n is a probability measure and x l k£< ^ a is a concave function on x > 0, we 
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obtain by Jensen’s inequality that 


s k 2 l 2 <f> g / g k 




x exp ( -s / (j) s (x, {xi,..., x k })n(dx) ) /i k (d(x i,..., x k )) 




< 


gk 2 (. 2 <t> s / gk 


fc\l 2 k(t> s \ 


h'tfrs (*^1 ) ■ ■ • ) *^fc) 


x exp ( -s / 0 s (x, {xi,..., x k })n(dx) ) / r k (d(x 1 ,..., x k )) 


i(l—k£cf> s ) 


which tends to a 1 by (16.3p . (16.4j) and our assumption on <f s . Thus E [(N k )e\ —> a e , 
so by the method of moments (see e.g. Theorem 1.22 of [3]) the result (i) follows.□ 


Proof of Theorem I A HI (a) (ii). Assume now that <f s = o(s -1 / 2 ). Again we use 
the method of moments. Write n for |_sj, and set N' k = N k (G(X n ,(f) n )). Then by 
(16.61) and the union bound, 


e mu = 


( n)kt 

k\ £ 


| "J ^</>n (*U,1) • ■ ■ Xi jk ) J U n ({Xj 1 , • • • , X£ k j” ) 




( r- \ n—kl 

1 - / f n (z] {xi,i,..., xe,k})fJt(dz) j fi{dx lt i) • • • fjt{dx it k) 

J■■■ J (nVKn-^)) 

x(l-£ I </>n(u • • •, x it k})(Jt(dz) fJt(dx 1A ) ■ ■ ■ fJL(dx itk ). 


- (1 + 0(1)) ( ¥ 


i =1 


Using the bound 1 — x > exp(—x — x 2 ) for small positive x, we have for large n that 
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so that 


E[M) { ] > (1 + o(l)) ( I h <j>n (x 1 ,...x k ) 


n 


x exp J 4> n (z, {xi,..., x k })n(dz)^J fi k (d(x i,..., dx k ))^j x exp (~nk 2 f(f) 2 n ) 

which tends to a e by (16.31) . (16.41) . and the assumption that 0 n = o(n -1 / 2 ). 
Conversely, by the bound e x > 1 + x for all x G M, we also have 




E [{N* k ) e ] < ( - 


x exp ( -(n - £k) / 0 n (z; {xgi,... ,xe tk })fJ>(dz) ) n{dx 1 , 1 ) • • ■ n(dx £jk ) 


and using the Bonferroni bound as in (16.71) . we obtain that 

e[wm <(it) / ■ / n^,-J 

x exp I — n E {Xi, 1 , • • • ,x i)fc })(l - k£(f) n )fi(dz ) fi(dx i,i) • • ■fj.{dxe jk ). 


1=1 


By Jensen’s inequality, since x 1 k£< ^ n is a concave function on x > 0, we have 


E [«)(] < ( |r 


1/(1 -kl<j> n ) 


yj (*^l, 1) • • • Xi : k) 


X 


^ 2—1 


exp n E/ 0n(^; {^i,i, ■ ■ ■, x itk })n(dz)j n(dx ip) • • • A i{dx £>k ) 




k2( - 2 4> n ^k 2 (. 2 (j) n 


< e 1 Ti¬ 


ki 


n 


hcj, n {x i,...x fe ) 


n \ \ t(l-fct0„) 

x exp ( -n / (j) n (z) {xi,..., x fe })/x(dz) J p(dxi) • • • p(dx fc ) J 


Hence, since our assumption <j) n = o(n -1 ' 2 ) implies that (j) n = o(l/(logn)), we have 
limsupE [(jVjQ^] < cr. Therefore E [(N' k )^ —> cr, and the method of moments gives 
us part (ii). □ 
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Proof of Theorem \2.%\ (a) (in) and (b). Assume there exists £ > 0 such that 
0 S G and 0 S < 1 — £ for all s. Set a s = sup xe xf 4 > s(x,y)fi(dy). By (j6.3[) . 

EN k (G(V s ,<f s )) = 0( S fc a^- 1 )exp(-0( S as)). (6.8) 

If sa s remains bounded away from zero and infinity, then by (16.8ft we have that 
E N k (G(V s , 0s)) —> oo, contradicting (16.dll . Therefore for any sequence of values of 
s tending to infinity, there is a subsequence such that either sa s —* 0 or sa s —* oo 
as s —* oo along the subsequence. 

Consider first the case with sa s —> 0. In this case, by (16.8p and (j6.4[) we have 
k > 2 and s k a k ~ 1 = 0(1). Recalling that H k (G ) denotes the number of connected 
induced subgraphs of a graph G of order k, we have 

EH k+ i(G(fP s ,(j) s )) = 0(s k+1 a k ) = o(l). 

Since 0 < H k (G(V s , 0 a )) - N k (G(V s , 0s)) < [k + 1 )H k+1 (G(V s , 0 S )), we have 

E [H k (G(V s , 0s)) - N k (G(V s , 0s))] -G 0, 

so by (I6.4[) we have that E H k (G(V s , 0 S )) —> a. Hence by Theorem 12.31 we have 
H k (G(V s , 0s)) ~ Z a . Also, by Markov’s inequality, P [H k (G(V s , cp s ))-N k (G(V s , 0 S )) > 
1] —> 0, so we also have N k (G(V s , 0 S )) —> which is the first part of (j2.3j) . 

Suppose now that sa s —> oo. There exists K < oo such that if sa s > A'logs 
then by ( 16. 8 j) . 

EiV fc (G(iPs,0 s )) = 0(s(sas) fc " 1 x e - 0 ( sa ^ x s" 2 ) 

which tends to zero, contradicting (16.411 . But also if there exists 5 > 0 such that 
sa s < (flogs, then there is a constant 5' > 0 such that 

EN k (G(P sl 0s)) > 5's(sa s ) k ~ 1 s~ 1 ^ 2 , 

which tends to infinity, contradicting (I6.jp . Therefore we must have 

sa s = ©(logs). (6.9) 

We seek to apply Theorem |3.1[ For s > 0, let r/ s , Z s , r, ri, 72,.. • be as in Section 
[4] Let f s be the function / considered in Lemma 14.21 using the connection function 
0 = 0s- That is, let f s (x\, ti,..., x k , t k , £,) be the indicator of the statement that 
..., x k } induces a component of G ( f >s (uf =1 {(xj, tj)} U £). Then with F s denoting 
the function F obtained by taking f = f s in the definition 03.2]) . we have that 
F s (r} s ) = W(G , </> s (^s)) which has the same distribution as N k (G(V s ,4 > s)). 
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For x = (xi,..., Xk) G X fc , set X = {aq,..., Xk} (allowing multiplicities). Define 
the graph 

Q s := G^X^s U {(xi,ri),..., (x k ,r k )}). (6.10) 

Let V s ,x be the set of points of V s connected to at least one point of X in Q s and 
let P* V,\V,. X . 

The subgraph of Q s induced by vertex set V s has the same distribution as 
G(V s ,(f> s ), and we shall refer to this subgraph as G'(V s ,(f> s ). Likewise, we refer 
to the subgraph of Q s induced by vertex set Vf as G'(V£,<j> s ), and we refer to the 
subgraph of Q s induced by vertex set X as G'(X, (p s ). 

Let U x = N k (G'(V s , 4> s )). This has the same distribution as F s (r] s ). 

Let V x = N k (G'(Vf,(f>s))- We claim that this has the same distribution as 
conditional distribution of F s (uf =1 {(xj, Ti)}Ur] s ) — 1 given that f s (U k =1 {(xi, t*)}, rj s ) = 
1. This is because by the Marking Theorem for Poisson processes (see e.g. m ), 
the point processes V s , x and Vf are independent and the statement that Q s has X 
as the vertex set of a component is the equivalent to the statement that (i) V s , x 
has no points and (ii) the graph G'(X, <p s ) is connected, which is independent of the 
outcome of G'(Vf , 4> s ). 

Also, U x — V x = U x — V^, where U x denotes the number of ^-components of 
G'(V s , (p s ) with at least one vertex in V StX , and V x is the number of /c-components 
of G'(Vf, (ft s ) with at least one neighbour in V s . x . 

By the Mecke formula 

= y J y)V(y) ex P J M z ^y)^(dz) s j n k {dy ), 

where 0 s (x, y) is given by flfi.lll . Then since we can choose the elements y\ ..., y k of 
y in an order such that y\ is connected to X and for each j > 2, yj is connected to 
{ 2 / 1 , • • -,Vj- 1 }, we have 

E U x = 0(s k a k s ) x exp (—0(sa s )), (6-11) 

which tends to zero (uniformly over x). 

Now V x is bounded by the number of pairs (y, z) with y e V s , x and z = 

... ,z k ) with {z\, ..., z k } inducing a fc-comp orient of G'(Vf, <p n ) and z connected 
to y. Hence by the Mecke equation, 


ET'< s k+1 



0*(j/,x)0 a (j/,z)(l -0 s (x,z))^ s (z) 

x exp ( — s I (f> s (w, z)(l — <f) s (w, x.))y(dw)^j y k (dz.)y(dy) 

= 0((sa s ) k+1 ) exp(—0(sa s )), 


( 6 . 12 ) 
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which tends to zero, uniformly over x; here we have used the assumption that 
0 S < 1 — e, so that (p s (w,z ) < 1 — e k , for all s, w, z. Therefore E[|i7 x — V*|] = 
E [\U^ — fA|] —> 0, uniformly over x 6 X 1 ’. Then we can use Theorem 13. II to get the 
first part of (12.31) . 

Before completing the proof of part (a) (iii) of Theorem 12. 21 we we prove part (b), 
so now instead of (j6.4(1 we assume a s := E N k (G(V s , (fi s )) —$■ oo, but a s = o(s). Then 
by (16.81) . for every sequence of values of s tending to infinity, there is a subsequence 
such that either sa s —> oo or k > 2 and sa s —» 0 as s —* oo along the subsequence. 
In both cases, the estimates (16.111) and (16.121) hold so by Theorem 13.11 we have 

d TV (C(N k (G(V s ,<j) s ))), C(Z a J) ->■ 0. Since also (Z Qa - a s )/y/oT s A f, it follows 

v 

that (N k (G(V s , 4> s )) — ol s )/ sf&s —> A/”. This completes the proof of part (b) (ii). 

Now we return to part (a), so we go back to assuming (16.40 . As in the cor¬ 
responding part of the proof of Theorem 12.11 for n 6 N set s{n) = n — n 3 / 4 and 
t(n) = n + n 3 / 4 . Then V s ( n ) C X n C V t (n) with high probability, and also the 
point process Vt(n) \ 'Pstn) is a Poisson point process with mean measure 2n 3 / 4 /i(-), 
independent of V s < n \. By (16.91) . 

exp (n 3/4 / <f> n (y,x)n(dy)] = exp (n~ 1/4 x 0(logn)) = l + o(l), 


uniformly over ifX, and therefore by (16. 4 1) the sequence (< p n )neN satisfies 


s 




<f> n (z,x.)n(dz) 


/i 0 n (x)// fc (c(x) ->• a. 


(6.13) 


For n,f gN with 1 < £ < k, let A n ^ be the event that at least one collection of 
t of the added vertices of X n \ V s {n) lies in a /e-component of G(X n , <j> n ). Let B n be 
the event that at least one of the added vertices of V t ( n ) \ Ps{n ) is connected to one 
of the ^-components of G(V s ( n ),(ft n ). 

If A n j occurs and V s i n ) C X n C Vt { n ), then there is at least one pair (X,y), 
such that X C V s i n ) has k — £ elements, and y C Vt{n) \ Pstn) l ias ^ elements, and 
XU y induces a connected subgraph of G(Vt( n ), 0 n ), an d there is no connection 
between any vertex of X U y and any vertex of V s ( n ) \ A" (however, we do allow other 
connections between vertices of A U y and other vertices of Vt(n) \ P s (n))- By B ie 
Mecke equation, the expected number of such pairs equals 


(2 n 3 P) £ 

J 

exp s(n) 


(■ s(n )) 


k-e 


h'tfrniXh ■ ■ ■ i %k—t, Vli ■ ■ ■ i Vi) 


(k-E)\ 

<j>n(z, {xi,..., x k _ e , 2 / 1 , — , yi})/j{dz) 
H k ~\d(x i, ..., aw))//(%i,..., ye)), 
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which tends to zero by (16.131) . so P[A n ^] —> 0. Also, the expected number of k- 
components in G(V s ( n ), 0 n ) which are connected to at least one vertex of V t ( n ) \V s (n) 
is at most 


( s(n)) k 

k\ 



4>n{z, 



h 0n (x)2n 3/4 fca„/i fc (dx), 


and by (16.131) and (j6.9[) this tends to zero. Hence P[£ n ] —> 0. By the hrst part of 
(12.3p we have for k G N that N k (G(V s ( n ), (f> n )) —> Z a . Also 

P [N k (G(X n ,(t) n )) ± N k (G(V s{ n)An))] < nu k e=1 A n/ ]+F[B n }+F[{Z <n) <n< Z t{n) } c ], 
which tends to 0, and the second part of (12,3[) follows. □ 


7 Number of edges 


For any graph G, according to our earlier notation // 2 (G) denotes the number of 
edges of G. Let 0 G <f>. Then 


(7.1) 


= EH 2 (G^(Vs)) = / / <l>(x,y)s' 2 n(dx)ti(dy). 

Jx Jx 

Theorem 7.1. Suppose 0 G <L. Set a := E H 2 (G(V s , (f>)). Then 


d TV {H 2 {G(V s A)),Z a ) < (1AT 1 ) 


4>(x,y)sii(dy) ) sp(dx). (7.2) 


Proof. We shall use Theorem 13.11 Given x, y G X with 0(x, y) > 0, set Q s = 
G<p a (rj s U {(a;, Ti), (y, r 2 )}). Let G'(fP s , 0 S ) denote the subgraph of Q s induced by the 
vertex set V s , which has the same distribution as G(V s ,(j) s ). 

Set U XtV = H 2 {G'(V s As)) and V x ,„ = H 2 (G S ) - 1 {{x,y} G E(Q a )}, where E(G) 
denotes the set of edges of a graph G. Then V x<y has the conditional distribution of 
H 2 (G s ) - 1 given that {x,y} G E{Q S ). Also V XiV > U x , y and 

E [V x ,y - U x>v ] = / (<j)(x, z ) + <p(y, z))sy(dz) =: w(x, y). 


Hence by Theorem 13.11 the left hand side of (I7.2j) is bounded by the expression 
(1/2) (1 A cG 1 ) J j w(x,y)(j)(x,y)s 2 y(dx)n(dy) 

= (1 A a; -1 ) / ( / <f>(x,y)sfj,(dy)) sy(dx) 
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as required. 


□ 


For example, consider the geometric setting with /i having bounded, almost 
everywhere continuous density / with respect to Lebesgue measure on M d . Suppose 
4> s (x,y) = (j){r~ l {x-y)) for some fixed integrable and almost everywhere continuous 
</>, and r s > 0 satisfying s 2 r d s —> /3 for some j3 > 0. Set a s : = E H 2 (G(V S , 4> s ))- Then 
by dLU), 


ot 3 —t ft 


(f)(z)dz) J f(x) 2 dx =: a. 


By (E2D we have d T v(C-{H 2 (G(V s , </> s ))), C(Z aa )) = 0(s 3 r 2d ) = 0(s 1 ). 

In particular H 2 (G(V S , (p s )) —> Z a . This could possibly also be proved by de¬ 
riving a Poisson limit for H 2 (G(X n , <j) n )) by adapting the argument in (TSJ Theorem 
3.4] to the RCM, and Poissonizing. However, the Poissonization would seem to 
introduce an error of at least s -1 / 2 in the total variation distance, so the rate of 
convergence would not be as good. 


Proof of Theorem \2.3l Let fceN with k > 2. Let e > 0 and assume <f> s G $£ for 
all s > 0. Set G s := G(V s ,(j) s ). Set a s = sup x€ xf 4 > s( x ,y)l J '(dy). Then with h^(-) 
defined at (16.21) . we have 

E H k (G s ) = J s k h lj)s (xi,...,x k )ii k (d(x 1 ,...,x k )) = <d(s k a k s - 1 ), (7.3) 


because for each connected graph T on {l,...,fc} we can integrate the variables 
x\,,Xk in an order (x CT (i),..., x a ^)) such that for 2 < i < k each successive a(i) 
is connected in T to one or more of cr(l), ..., a(i — 1). Then each successive integral 
gives another factor of 0(a s ). 

Let x = (xi,..., Xj t) G X fc , with xi,...,Xk distinct and with h^ s {x i,..., xjf) > 0.. 
Let G s be the graph defined by (16.101) . but now conditioned on the subgraph induced 
by {xi,... ,Xk} being connected. Denote by G'ifPsiGs) the subgraph of Q s induced 
by V s . Set U x = H k (G'(V s , (f> s )) and V x = H k (Q s ) — 1. Then U x has the distribution 
of H k (G s ) and V x has the conditional distribution of H k (G(V s U {aq,... ,x k },f> s )) 
given that {aq,..., x k } induces a connected subgraph of this graph. 

Now V x > U x and we assert that 


E [V x 


u x } = 0 



(7.4) 


uniformly over x G X fc . To see this, observe that V x — U x is the number of pairs 
(X,y) with X a non-empty subset of {x±,... ,x k } and y a non-empty subset of 
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V s , such that the subgraph of Q s induced by vertex set X U y is a connected graph 
of order k (to ease notation we ignore the issue of multiplicities in this notation). 
But then we can take the successive elements y % of y in an order such that each of 
them is connected to at least one existing vertex from X U {y \,..., Ui-i}- Then each 
successive integral gives another factor of 0(sa s ). 

For part (i), assume E H k (G s ) —> a G (0, oo). Then by (j7.3[) and (17.4p we have 

E 114 — C/ x | = o(l), uniformly over x G X fc . Hence by Theorem 13.11 H k (G s ) ——>■ Z a , 
which is part (i). 

For part (ii), set a s = E H k (G s ), and assume a s —* oo but a s = o(s). Then by 
(17.31) . the assumption a s = o(s) implies that sa s —> 0. Then by (17. 4p and Theorem 

13.11 we have that dTv(F(H k (G s )), C(Z a J) —» 0. Since also ( Z as — a s )/y74 A f, 
we therefore have ( H k (G s ) — a s )/ —» J\f. □ 


8 Poisson U-statistics 

Let k E N, and let S fc (X) := {£ G S(X) : |^| = k}. Let h : S*(X) ->■ {0,1} be 
measurable. For £ G S(X) set 

m= E 

il><Z£'-\il>\=k 

Let s > 0, and let r) be a Poisson process on X with mean measure A := s/i. We seek 
to apply Theorem 13.11 to W := F(rj) for this choice of F, the so-called Poisson U- 
statistic. Assume /i is diffuse; as is the case elsewhere in this paper, this assumption 
is for notational convenience only. 

For x = (xi,..., Xk) G X fc with h({x i,..., x k }) = 1 we set f/ x = F[rj), and 

U = F(ut,Mu,)-i. 

Clearly 14 and L4 have the required distributional properties in the statement of 
Theorem 13.11 Also 14 > 74, and 

|V4-74|= /i(^U{x,:iG J}), 

J<Z[k],l<\J\<k il>Cr):\il>\=k—\J\ 
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so using the Mecke formula and integrating over x we obtain that 


(E|U x -[/ x |)/i(x)A fc (dx) 


= E 


e\(k - ()!°- / ( 


= £(*=£)! Q / j h({x„ ... ... ,y k .,}) 

xA*“*(d(j/i, . . ,,y k _ e ))h(x i, .. .,x k )X k (dx) 

h{{x u ... , xt, 2 / 1 ,..., y k -i})>^ k ~ t {d{y 1 , ..., y k -())^j 

xX e (d(x 1 ,.. .,x t )) 
=: 7 (h, A). 


By Theorem 13.11 if E F(rj) = a, then dTv(£{F(v))i £(Z a )) < (1 A a! - 1 ) 7 (/i, A )/k\. 
Also dw(£(F(rj)), C(Z a )) < 3(1 A a _ 1 ^ 2 ) 7 (^, X)/k\. This bound is comparable to 
the one obtained in Theorem 7.1 of |5j - Our bound has an extra factor 1 A a -1 / 2 in 
front, which may make it better when a is large. Also, unlike [5] we do not make 
any topological assumptions on the measurable space X. As remarked just after the 
statement of Theorem 13.11 it is possible to extend that result to the case where the 
measure A is u-hnite, and hence to extend the above argument likewise, but we do 
not go into details here. 
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